A Generic Tool for Creating and using Multilingual Phrasebooks

نویسندگان

  • Michael Zock
  • Guy Lapalme
چکیده

To speak fluently is a complex skill. If reaching this goal in one’s mother tongue is already quite a feat, to do so in a foreign language can be overwhelming. One way to overcome the expression problem when going abroad is to use a dictionary or a phrasebook. While neither of them ensures fluency, both of them are useful translation tools. Yet, neither can teach you to speak. We will show here how this can be done in the case of a phrasebook. Being interested in the learning of foreign languages, we have started to build a multilingual phrasebook (English, French, Japanese, Chinese) whose sentence elements, typically words or expressions, are clickable items. This fairly simple feature allows extending considerably the potential of the resource. Rather than learning merely a list of concrete instances (sentences), the user may learn in addition the underlying principles (patterns), that is, the generative mechanism capable to produce quickly analogous, but more or less different sentences. A similar feature may be used to extend the resource, by mining a corpus for sentences built according to the same principle, i.e. based on the same pattern, but this is work for the future. Two of the main goals of this paper are to present a method helping learners to acquire the skill of speaking (patterns augmented with rules), and to allow experts (teachers) to add information either to extend the database or to add a new language. We’ve started from English and Japanese, adding very quickly French and Chinese.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transculturation and Multilingual Lives: Writing between Languages and Cultures

This paper looks at the issues of transculturation as explored in auto and semi-autobiographical accounts of linguistic and cultural transitions. The paper also addresses a number of questions about the structure of these texts, the authors’ linguistic competences, as well as questions about the theoretical and conceptual tool which may help us to discuss the issues the writers are reflecting o...

متن کامل

Voice aided input for phrase selection using a low level ASR approach - application to French and Khmer phrasebooks

We report the ongoing results of an effort to embed a “light” ASR for a future smart-phone-based multimodal multilingual phrase-book which allows users to look for a sentence by simply pronouncing it. We compared a phoneme-based low level approach with a conventional word-based high level approach. The former approach has been found promising in terms of accuracy and performance in a restricted...

متن کامل

Multilingual Information Extraction with PolyglotIE

We present POLYGLOTIE, a web-based tool for developing extractors that perform Information Extraction (IE) over multilingual data. Our tool has two core features: First, it allows users to develop extractors against a unified abstraction that is shared across a large set of natural languages. This means that an extractor needs only be created once for one language, but will then run on multilin...

متن کامل

Creating a multilingual collocations dictionary from large text corpora

This paper describes a system of terminological extraction capable of handling multi-word expressions, using a powerful syntactic parser. The system includes a concordancing tool enabling the user to display the context of the collocation, i.e. the sentence or the whole document where the collocation occurs. Since the corpora are multilingual, the system also offers an alignment mechanism for t...

متن کامل

Creating a Multilingual Collocation Dictionary from Large Text Corpora

This paper describes a system of terminological extraction capable of handling multi-word expressions, using a powerful syntactic parser. The system includes a concordancing tool enabling the user to display the context of the collocation, i.e. the sentence or the whole document where the collocation occurs. Since the corpora are multilingual, the system also offers an alignment mechanism for t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010